Deceased Humpback Whale - Joshua Meza-Fidalgo, 2020

Deceased Humpback Whale - Joshua Meza-Fidalgo, 2020


Rationale and Research Questions

Marine mammal and sea turtle strandings can be unusual events or indicators of problems in our ocean ecosystems. Seeing patterns in the strandings of marine mammals and sea turtles can be indicative of more complex issues that may not be transparent. Some example of these issues could be viral outbreaks in lower trophic level populations or toxic algal blooms that kill off all sea life in the area. Because humans demand and consume seafood, understanding the problems that arise are essential to preventing the consumption of contaminated seafood along with the management of fisheries. By studying marine mammal and sea turtle strandings, we get insight on the health of our oceans.

Our objective is to analyze animal strandings over the years to see if there any trends in the quantity of strandings for each family (cetaceans, odontocetes, mysticetes, and pinnipeds). We have decided to look at the total number of strandings per year for all families, the changes in strandings over the years for each family and the seasonality differences across all years for each family.

Dataset Information

Table 1: Dataset Information
Detail Description
Data Source OBIS-SEAMAP - Mystic Aquarium
Retrieved From https://seamap.env.duke.edu/dataset/945
Variables Used Family, Common Name, Species Name, Date, Year, Month, Latitude, Longitude, Total Strandings, Strandings by Species
Data Range March 18, 1976 to December, 29, 2011

The dataset that we chose to work with is the Mystic Aquarium’s marine mammal and sea turtle stranding data that started in 1976 until 2011. However, there was only one stranding record in 1976, so that point was excluded from our analysis. The rest of the data started in 1990 and was concluded in 2011. It contains the number of whale (odontocetes and mysticetes), pinniped, and sea turtle strandings that occurred along the coasts of Connecticut, Rhode Island and Fishers Island, New York, USA. There were a total of 1140 strandings; within these strandings, 155 were odontocetes, 50 were mysticetes, 679 were pinnipeds, and 256 were sea turtles.

The dataset has a latitude that ranges from 41.00 to 41.86 while the longitude ranges from -73.65 to -71.10. We data was provided using the datum WGS 1984 (4326) and we performed a data transformation into UTM 19 (32619) for geospatial analysis. The data was wrangled to contain only the variables that we were interested in using. We then selected parts of the data based off of the Common Name of the species to create datasets that were easier for us to use in our analysis. This helped us divide the data into four groups: odontocetes, mysticetes, pinnipeds, and sea turtles. We initially looked at total yearly strandings for trends in the data for each family. After we analyzed the data on a monthly basis to help give insight to seasonal stranding trends for each family.

Figure 1: The Column Names of the Raw Data
##  [1] "dataset_id"      "row_id"          "latitude"        "longitude"      
##  [5] "species_name"    "scientific_name" "common_name"     "itis_tsn"       
##  [9] "group_size"      "series_id"       "date_time"       "timezone"       
## [13] "ds_type"         "platform"        "provider"        "lprecision"     
## [17] "tprecision"      "oceano"          "notes"           "last_mod"
Figure 2: The Column Names of the Processed Data
## [1] "Family"       "common_name"  "species_name" "Date"         "Year"        
## [6] "Month"        "latitude"     "longitude"
Table 2: The First Rows and Columns of the Final Processed Data
Table 3: The First Rows and Columns of Strandings of Each Family per Year
Table 4: The First Rows and Columns of Strandings of Each Family per Month

Exploratory Analysis

We initially decided that it would be best to look at the data holistically to get a better feel for the trends in the data. We first analyzed the total number of strandings per each family, as seen in Figure 3. To better see the data, we divided summed the strandings of each family for each year from 1990 to 2011, as seen in Figure 4. To see if there could be seasonal trends for the entire data set, the data was grouped by month for each family from the years 1990 to 2011, as seen in Figure 5. Finally, to have a geospatial understanding of our data, Map 1 was created.

Figure 3: Total Strandings per Family

Figure 4: Total Strandings per Year

Figure 5: Total Strandings per Month

Map 1: Geospatial Exploration of each Family

Analysis

NOTE– Did we ever do an analysis for the entire data set without dividing it by family????

Because we decided to divide the data into groups by their families, we conducted separate analysis for each of the families. Our research questions are the following:

  1. Pinnipeds:

  2. Whales: -Odontocetes

#Odontocete Data Exploration

##          Family                          common_name
##  Mysticetes :  0   Short-beaked Common Dolphin :52  
##  Odontocetes:155   Harbor Porpoise             :38  
##  Pinnipeds  :  0   Atlantic White-sided Dolphin:14  
##  Turtles    :  0   Long-finned Pilot Whale     :13  
##                    toothed whales              :12  
##                    Striped Dolphin             : 9  
##                    (Other)                     :17  
##                        species_name      Date                 Year     
##  Common dolphin              :52    Min.   :1990-02-10   Min.   :1990  
##  Harbor porpoise             :38    1st Qu.:1999-05-20   1st Qu.:1999  
##  Atlantic White Sided Dolphin:14    Median :2004-09-05   Median :2004  
##  Long-finned pilot whale     :13    Mean   :2003-08-24   Mean   :2003  
##  Unidentified Odontocete     :12    3rd Qu.:2009-02-16   3rd Qu.:2009  
##  Striped dolphin             : 9    Max.   :2011-12-16   Max.   :2011  
##  (Other)                     :17                                       
##      Month           latitude       longitude     
##  Min.   : 1.000   Min.   :41.00   Min.   :-73.65  
##  1st Qu.: 3.000   1st Qu.:41.30   1st Qu.:-71.58  
##  Median : 5.000   Median :41.42   Median :-71.45  
##  Mean   : 6.065   Mean   :41.40   Mean   :-71.55  
##  3rd Qu.: 8.000   3rd Qu.:41.49   3rd Qu.:-71.30  
##  Max.   :12.000   Max.   :41.77   Max.   :-71.13  
## 

There appears to be an increasing trend in the number of odontocete strandings as the years progress. There may be possible seasonal trend based on season, where the spring and early summer see a greater number of strandings than the other seasons.

#Odontocete Analysis ##Statisical Analysis

Null Hypothesis 1: There is no effect of year on the total number of odontocete strandings Alternative Hypothesis 1: There is an effect of year on the total number of odontocete strandings

Null Hypothesis 2: There is no effect of month on the total number of odontocete strandings Alternative Hypothesis 2: There is an effect of month on the total number of odontocete strandings

## Call:
##    aov(formula = Odontocetes_Total ~ Year, data = annual_odontocetes_strandings)
## 
## Terms:
##                      Year Residuals
## Sum of Squares  195.90316  91.05138
## Deg. of Freedom         1        20
## 
## Residual standard error: 2.133675
## Estimated effects may be unbalanced
##             Df Sum Sq Mean Sq F value   Pr(>F)    
## Year         1 195.90  195.90   43.03 2.16e-06 ***
## Residuals   20  91.05    4.55                     
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## Call:
##    aov(formula = Odontocetes_Total ~ Month, data = monthly_odontocetes_strandings)
## 
## Terms:
##                    Month Residuals
## Sum of Squares   31.8619  459.0548
## Deg. of Freedom        1        10
## 
## Residual standard error: 6.775358
## Estimated effects may be unbalanced
##             Df Sum Sq Mean Sq F value Pr(>F)
## Month        1   31.9   31.86   0.694  0.424
## Residuals   10  459.1   45.91

We reject the null hypothesis that there is no effect of year on the total number of odontocete strandings due to a p-value of 2.16e-06. We also fail to reject the null hypothesis that there is no effect of month on the total number of odontocete strandings due to a p-value of 0.424. This indicates that there is a difference in the number of odontocete strandings across the years of the study, but not across the months. This could be explained by increase boat and shipping traffic as the years progress that contribute to increased ocean noise and interference with their communication that have led them to become stranded. Seasonality may not have an effect because of the consistency of boat and ship traffic throughout each year.

#Geospatial Analysis

Knowing where the “mean” stranding location is could allow for a study of that area in order to determine why odontocete strandings are so prominent on this area and allow for the allocation of more recovery resources and personnel to that area to compensate for the higher levels of strandings. Much of the strandings occurred from Charlestown to Newport, in the Rhode Island Sound.

distance_group totals_by_distance
140,000 - 160,000 1
60,000 - 80,000 1
120,000 - 140,000 2
> 160,000 3
100,000 - 120,000 3
40,000 - 60,000 20
< 40,000 125

#Temporal Analysis Null Hypothesis: The odontocetes stranding data is stationary Alternative Hypothesis: The odontocetes stranding data is not stationary

## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## WARNING: Error exit, tauk2. IFAULT =  12
## Score =  12 , Var(Score) = 235.3333
## denominator =  102.2623
## tau = 0.117, 2-sided pvalue =0.43407

Because the 2-sided p-value was 0.434, we fail to reject the null hypothesis and say that odontocetes monthly stranding data is stationary.

## Score =  -11194 , Var(Score) = 2672700
## denominator =  39279
## tau = -0.285, 2-sided pvalue =7.5333e-12

Because the 2-sided p-value was less 7.53e-12, we reject the null hypothesis and say that odontocete yearly stranding data is not stationary.

3. Sea Turtles:

Turtle Data Exploration

##          Family                    common_name 
##  Mysticetes :  0   Leatherback Sea Turtle:176  
##  Odontocetes:  0   Loggerhead Sea Turtle : 63  
##  Pinnipeds  :  0   Kemp's Ridley         :  9  
##  Turtles    :256   Green Sea Turtle      :  4  
##                    Turtles               :  4  
##                    Atlantic gray seal    :  0  
##                    (Other)               :  0  
##                               species_name      Date                 Year     
##  Leatherback - Dermochelys coriacea :176   Min.   :1990-09-15   Min.   :1990  
##  Loggerhead - Caretta caretta       : 63   1st Qu.:1995-08-15   1st Qu.:1995  
##  Kemp's ridley - Lepidochelys kempii:  5   Median :2000-09-13   Median :2000  
##  Kemp's Ridley - Lepidochelys kempii:  4   Mean   :2001-05-31   Mean   :2001  
##  Unknown/Unidentified               :  4   3rd Qu.:2008-02-12   3rd Qu.:2007  
##  Green turtle - Chelonia mydas      :  3   Max.   :2011-09-22   Max.   :2011  
##  (Other)                            :  1                                      
##      Month          latitude       longitude     
##  Min.   : 2.00   Min.   :41.00   Min.   :-73.57  
##  1st Qu.: 8.00   1st Qu.:41.27   1st Qu.:-71.78  
##  Median : 8.00   Median :41.35   Median :-71.55  
##  Mean   : 8.34   Mean   :41.36   Mean   :-71.66  
##  3rd Qu.: 9.00   3rd Qu.:41.47   3rd Qu.:-71.36  
##  Max.   :12.00   Max.   :41.75   Max.   :-71.10  
## 

Exploratory analysis of the sea turtle stranding data set by year shows higher amounts of strandings in the early 1990s and late 2000s, with the highest amount in 1995. The most prominent species throughout our study period appear to be leatherback sea turtles, followed by loggerhead sea turtles. Through monthly data exploration, strandings peak from July to September and occur less frequently the remainder of the year.

Turtle Spatial Analysis

Turtle Spatial Table

Distance (m) Number of Strandings
less than 20,000 92
20,000 - 40,000 113
40,000 - 60,000 32
60,000 - 80,000 2
80,000 - 100,000 4
100,000 - 120,000 2
120,000 - 140,000 6
140,000 - 160,000 4
greater than 160,000 1

Of the 256 sea turtle strandings used in our analysis, 113 of them occured within 20,000 - 40,000m of the mean stranding location, followed by 92 strandings within 20,000m of the mean location (Table 1). These points were in the areas around Charlestown, RI, Newport, RI and New Shoreham, RI. This high concentration of strandings can indicate that these areas may be sea turtle stranding hotspots. This could be due to various factors, such as the flow of oceanic currents, anthropogenic impacts (such as higher fishing presrure), or the fact that there may be more manpower employed in these areas that could make stranding detection easier. Further research is needed to determine how these underlying factors may be contibuting to sea turtle strandings in these areas.

Turtle Statistical Analysis

Null Hypothesis 1: There is no effect of individual year on sea turtle strandings.
Alternative Hypothesis 1: There is an effect of individual year on sea turtle strandings.

Null Hypothesis 2: There is no effect of month on sea turtle strandings.
Alternative Hypothesis 2: There is an effect of month on sea turtle strandings.

Yearly Analysis: We reject the null hypothesis that individual year has no effect on sea turtle strandings. The sea turtle strandings in 9 of the 21 years are not significantly different (p-values: 0.147 - 0.706). 1995 was the most significantly different (p-value: 0.000236). 1991 (p-value: 0.0056), 1993 (p-value: 0.0078), 2008 (p-value: 0.0022), 2010 (p-value: 0.0016), and 2011 (p-value: 0.003) were the next most statitistically different years. The null deviance of this model is 7.4606e+01, which indicates ??? The residual deviance of this model is -4.8850e-15, which indicates ???

Monthly Analysis: We reject the null hypothesis that month has no effect on sea turtle strandings. Of the 12 months, 5 of months were significantly different. May (p-value: 0.000269), June (p-value: 3.98e-06), July (p-value: 1.91e-05), August (p-value: 0.004106), and September (p-value: 0.049935) were significantly different. This indicates that sea turtle strandings are concentrated to the summer months in our study area. The null deviance of this model is 4.9235e+02, which indicates ??? The residual deviance of this model is 3.0332e-10, which indicates ???

Turtle Time Series

Null hypothesis:
Alternative hypothesis:

Summary and Conclusions


Deceased Fin Whale - Joshua Meza-Fidalgo, 2020

Deceased Fin Whale - Joshua Meza-Fidalgo, 2020


References

Data: https://seamap.env.duke.edu/dataset/945

  1. Halpin, P.N., A.J. Read, E. Fujioka, B.D. Best, B. Donnelly, L.J. Hazen, C. Kot, K. Urian, E. LaBrecque, A. Dimatteo, J. Cleary, C. Good, L.B. Crowder, and K.D. Hyrenbach. 2009. OBIS-SEAMAP: The world data center for marine mammal, sea bird, and sea turtle distributions. Oceanography. 22(2):104-115.

  2. Smith, A. 2014. Mystic Aquarium’s marine mammal and sea turtle stranding data 1976-2011. Data downloaded from OBIS-SEAMAP (http://seamap.env.duke.edu/dataset/945) on 2022-04-02.

Photos: Special thanks to Joshua Meza-Fidalgo for providing whale photos for us to use.